Statistics of "TRUE" and "FALSE" 


In a game of cricket the team that wins the toss gets to 

choose to bat or to bowl. When a coin is tossed into the air, 

Aa An it lands showing one of the two faces namely the head or the 
— hen ol tail. {Picture above left} If the coin is tossed a number of 


times, would the number of “heads” be equal to the number 

of “tails”? We will ignore the very rare chance that the coin 

actually stands on its edge. If they are equal, the coin is 

called “fair’. But, how many tosses should we count? 

Everyone knows that we do not get two heads out of every 

four tosses. So a few tosses are not clearly sufficient. A 
further question. For the coin to be called fair, should the number of heads and tails be 
exactly equal or only approximately equal? Bias of a coin is the number of heads divided by 
the number of tails. The bias of a fair coin is 1. The number of heads and tails obtained 
after a large number of tosses would be equal. If the coin has a bias less than one there 
are more tails than heads. For example a coin with a bias of 0.33 will show two tails for 
every head. A coin with bias higher than 1 will show more heads. If the bias is 2, there will 
be two heads for every tail. 


Coins with bias different from 1 may not actually exist. It is also very difficult to develop a 
skill to make the coin land head or tail as desired. But it is possible to prepare mathematical 
descriptions of biased coins and obtain the results of tossing on a computer. Such an 
examination is very useful for evaluating the strength of experimental results. In many 
experiments, the results are available not as a distribution with a mean, median and mode 
but as one of a pair; YES-NO; SUCCESS-FAILURE; RIGHT-WRONG etc. Often this is the 
result of simplification. For example the result of a blood test for a disease may state that 


132 


Statistics of "TRUE" and "FALSE" 133 
NOT suffering from the 





the og IS suffering from a disease or that he or she i 
| baben ‘ bs a = " 


s the reason we 
ee the doctor 





The oka value emo 100 ee is 0. 96, eee ea ies one head for one ‘tail The ah on 


a 


134 Science In Small Steps 


PT ITLL Li 
FoF at Patat 
FF aPF FERS 
FRISIPEETF 
FoF aPEPESE 
Faataatatt 
FPSSPEF STFS 
SPF aPEESSS 
oPEPTTTF oe 
FFEP OFF SP e 


Coin 1 
& =28 P=72 
Bias = 0.39 


SPTTTT SFT S 
Sars 
bo yf 
FEST TTSSTF 
FaPatttTTTe 
FaataatatF? 
FEST aT S 
SFT atTTTTS 


PT ITT 
FPETaP TOFS 


Coin 1 
Bias (8 trials) = 1 


True Bias = 0.39 


Fatt taatFt 
oe CT TT | 
oF SFT SFTTF 
oF at aaddd 
FPS eaF o44F 
oaFF at aatFt 
ee bh | 
Fatt aat tte 
Faadat ata? 
FSF SITET FS 


Coin 2 
FAIR 


om =49 %=51 
Bias = 0.96 


edaecuaaau 
oF aaeeaadf 
FP obat Faded 
aé¢aceaaoaw 
FF aeadF Fe 
af atatacaad 
Fat? aca Pad 
6FeaF 466 
Faaddadddd 
Fat atatatF 


Coin 3 
& =74 P=26 
Bias = 2.85 


Fatt tact? 
ad 
aé 

oft at adddd 
eee TT be TT he 
SaFF at aatF 
FETTF aTF ae 
Fatt aat ta 
FaaaatataFt 
FFSFaTTT TS 


Coin 2 
Bias (8 trials) = 7 


True Bias = 0.96 


Gbavaddouad 
oFé4666446F 
FF aaa? Paced 
abacuadaue 
FF aaaaF FG 
ot atataada 

au 
oF aT 44646 
Faodcdddaadd 
FatatatatF? 


Coin 3 
Bias (8 trials) = 1 


True Bias = 2.85 


the left can be expected from a coin 
with bias 0.39, {Picture left top}. 
There are approximately two tails for 
each head. The results on the right 
can be expected if the bias of the 
coin is 2.85, nearly two heads for 
each tail. These bias values were 
calculated using all the 100 results. 


In the second picture, {Picture left 
bottom} a small number of results 
are highlighted by a dark colored 
rectangle. If the bias is calculated 
using the small number of results, 
the coins on either side appear to be 
fair and the coin in the center 
appears to have a bias of 7. To 
determine correct bias values, the 
number of results should be as large 
as possible. Those who challenge 
modern science argue emotionally, 
using an extremely small number of 
observations. The first lesson taught 
to scientists is to examine the results 
impartially and not to become very 
emotional about the conclusions. 


Previous Toss Next Toss 


4b) 


F i 


cy 


} | 
oe 
a) 


At 


to 


, 


oR é fF a 
Fala t tata? 


Metarerrre 
Tithe 


Be FFaFe 


Coin 1 
True Bias = 0.39 


Statistics of "TRUE" and "FALSE" 135 


If the first few results of a fair coin are heads, would the chance 


of a tail on the next toss increase? No. To believe that the 


chance would increase is called gambler's fallacy. {Picture left} 


In the results of coin tosses shown earlier, some series of 
heads and tails have been highlighted by rectangles of different 


colors. {Picture left below} In the results of the fair coin, one 
example each of two to nine heads has been highlighted. A tail 
can follow two heads or three heads. It can also follow nine 
heads. Alternate heads and tails are however very rare. The 


& Pht f SETS 
°° (mmr 
ta? Pate? 
o??é fH 
PTH? 444F 
oaFF at aatF 
FPPEF SPT aa 
Fatt aatt ta 
PHEEAT 4 F&F 
FPF SITET S 


Coin 2 
True Bias = 0.96 


highlighted results show that the 
M®éoeeeeeae@result of a future toss does not 
© °fmmie elim depend on past results. The results 
CEM on the left are for a coin with a bias 
ppt pg 0.39. There are obviously more tails 
Wa ®aaeeeee than heads. But even here, a head 
é6°2E% 4éé6follows after any number of tails. 
"O22 LTLT Series of two to ten tails are all 
Het 4%4%4** highlighted. But the chance of a head 
Coin 3 following a head is low. Similarly in 

True Bias = 2.85 the results on the right, of a coin with 


a bias of 2.85, series with two to ten heads have been highlighted. The bias determines the 
chance that the result of the next toss being a head or tail. The immediately previous 
results do not. One has to remember that the coin has no way of storing the results of the 
earlier tosses. The person tossing the coin may know the results of earlier tosses but 


136 Science In Small Steps 


unless there are facilities for cheating, the person cannot change 
the results obtained by tossing the coin later. 





Gamblers use an argument that looks logical. On an average 
) 50% results of a fair coin should be heads. But the previous few 
| tosses are far from the average. So the newer tossed should 
move the results towards the mean. Newer results moving the 
mean to the average is called regression to the mean. Knowing 
the difference between regression to the mean and gambler's 
fallacy is very important while using scientific knowledge. 


The dice with which children pay usually has six sides with numbers 1 to 6 or from 1 to 6 
dots. When the dice is thrown, one of the six faces will be on top. There are other types of 
dice. The ones in ancient India were thin and flat with a dot on one side. So either the side 
with the dot or the opposite side with no dot could be seen after throwing. Modern dice with 
a hundred sides numbered from 1 to 100 are also available. {Picture above} In the throw of 
a fair dice, all options are equally probable. 


Regression to the mean is easily explained using the hundred sided dice. If the result of the 
first throw is 73, the number obtained on the next throw of the dice is more likely to be a 
smaller number. But if the number obtained on the first throw is a small number like 13 the 
result of the next throw is more likely to be higher. So at first sight it appears that the 
numbers are somehow influenced by the past throw. This is called regression to the mean. 
A large number is followed by a small one, a small one by a large one. But just like a coin, 
there is no way a dice can store the results. So why is regression to the mean seen in dice 
throws but not coin tosses? 


Statistics of "TRUE" and "FALSE" 137 


In coin tosses we are not comparing the head and tail. We are counting the number of 
heads and tails before comparing them. In the example of the dice, we are comparing the 
numbers on the dice. That is the difference. When the hundred sided dice is thrown, any 
number from 1 to 100 is equally likely. But there are only 27 numbers larger than 73 but 72 
smaller ones. Obviously a smaller number is more likely. There is a small chance that 73 is 
followed by 87. But the chance of getting a smaller number in the next throw is even higher. 
Only 13 numbers are larger than 87. 


The relationship between the heights of fathers and sons is a 
good example of regression to the mean. Sons of a tall father 
are usually shorter. The probable height of the son could be 
anywhere in the Gaussian distribution described earlier. But the 
father is taller than the average of the Gaussian. So the fraction 
of people in the Gaussian distribution {Picture left} taller than 
<<» __ the father will be less. So the chances of a son being shorter 

sini are also higher. The comparison with the dice with a hundred 
Rees tee Bia shale sides is quite obvious. But there is a difference. The son could 

" also be tall because he inherited the genes from his father. 


Typical Tall Father 


A wrong example for regression to the mean can be seen in sports news. The critics will 
note that the team has unexpectedly lost three of four games and so is more likely to win 
because of regression to the mean. But the chance of a team winning is similar to the bias 
of a coin. It is determined by the results of a large number of games. That won’t change 
with the past few results. One can find psychological arguments for either winning or losing 
the next game. The losses may create despair leading to another loss or a determination to 
win. But the result has nothing to do with regression to the mean. 


138 Science In Small Steps 


Another important topic in this discussion of 
the statistics of the results has to be baseline 


ott t teat? &TTTM4"%e@ fallacy. As an example consider once again 
fet eeeeded T4T4PT4M4 the blood test with a ten percent chance of 
ooFattarrl tae TH" error. Suppose the test has been employed 
oHitatacaaed Se THt tee" over a large population. What percentage of 
a eto e ae Bataterrre? people get a wrong result? That percentage 
ce oaemee hhh will not always be 10%. To understand why 
Tee tT oSFaSSSSRS this is so, consider the results of the coins 
Badaatata? oP? PS9SK6 with biases 0.96 and 0.39 again. The picture 
rye rt SeeeNetata shows the results with a small modification. 


FA 5 False 44 True 
Gi 5False 46 True 


Error 10% No Change 


—A 7 False 25 True 
fi 3 False 65 True 


10% of the heads are wrongly marked as 
tails and 10% of the tails are wrongly marked 
as heads. These have been highlighted. 


Error 10% Bias Change picture left} This assumes that there is a 


10% chance of recording the tosses wrongly, perhaps because the light was insufficient for 
proper observation. For both coins, the error is 10%. The bias of the fair coin has not 
changed. The bias of the other coin has changed from 0.39 to 0.47. Why? In the results of 
the first coin, 5 heads had became tails and 5 tails became heads. The bias value remains 
0.96. But in the case of the second coin there are more tails. So 7 tails became heads and 
only 3 heads became tails. In practice the error in the results depends not only on the error 
percentage but also on the bias. This is the cause of baseline fallacy. 


Similarly, a fixed percentage of the population will not get wrong diagnosis. If the blood test 
was for detecting anaemia in poor people, there is a good chance that 50% of the 


Statistics of "TRUE" and "FALSE" 1 39 


TT 

JS a e ft ther thal 

ase. plaw the percentage of 
e Leas. 000 people may 


vhy prec 
strongly adve nt 
not edict 2 an rinertnauahe. The lo if al 





140 Science In Small Steps 


oe sa of being wrong. So the doctor keeps 
Fest Result1 |__| F chance of adding symptoms and tests. The 


“mt = +> False Positive I A, |Final ‘question asked at each step is the 
Sasa ‘Diagnosis : ‘same. What is the chance for the 
L T | Disease x | OF Sissaaag Patient to have this symptom or for 

— y na . this test to give this positive result 
L>”. ymptom Disease X ia ‘without the patient actually suffering 


panei Sinice Teva ‘from the disease. {Picture left} And 

ee sed amie — ‘finally what is the probability that all 

alta si ‘False ‘these symptoms and all the positive 

Mest Result 2 VF chance of iResult | test results could simultaneously be 

pee present without the patient suffering 

from the disease. The doctor is helped by every symptom and test result strengthening the 

others. Everyone of these must be independent. They join in parallel. When the disease is 

extremely dangerous, a second opinion is sought. Another parallel addition. This method, 
called Bayesian statistics was discovered by the 18th century English statistician Bayes. 


When the additional expenditure of materials and money is justified, to reduce the chance 
of failure, similar parallel support is employed in other areas. For example, an air plane is 
designed to work even if one of the two engines fails. There are two pilots in every plane 
though one is sufficient. 


In simple machines like a wheel, axle or pulley, there are very few parts. As the machine 
becomes more complex, the number of parts increases. The machine fails to work if any of 
the parts fails. This has been shown as a chain of toothed wheels. {Picture opposite page} 


Statistics of "TRUE" and "FALSE" 141 


Individual Failure rate 1% 
System Failure "t 10% 


1- (0.99X0.99....... | OF @; 
ot} -@: Individual Failure 1% for for | 


nine units and 10% for one « 


Individual Failure rate 10% "System Failure rate 18% | 
System Failure rate 65% be eee 





When the first wheel rotates, so will the last. But only if each and every toothed wheel in 
the middle also rotates. The last will stop if any of the intermediate wheels fails. This is a 
serial or chain like design. If the design includes parallel alternatives for each of them, the 
materials required for making the machine and the energy required to operate it increase 
making it too costly and unusable. How is the probability of failure of the complete machine 
related to the probability of failure of the individual units in series? If the chance of failure of 


142 Science In Small Steps 


each of the toothed wheels is 1%, is the chance of failure of the machine with ten wheels 
also 1%? No. The first wheel works 99% of the time. The second works 99% of 99%. So to 
get the chance of both working, we need to multiply the numbers. Since there are ten such 
wheels to get the chance of failure of the complete machine, we need to multiply ten times. 
The chance of failure in this case is about 10%. If just one of the wheels has a 10% chance 
of failure, rather than 1% like the others, the machine failure rate is 18%. If every wheel has 
a 10% chance of failure the machine would function only 35% of the time. When the parts 
are in a chain, they do not reinforce each other. One part cannot perform the job of another 
that has failed. So in a machine with large number of parts, individual efficiencies have to 
be extremely high to get a reasonable overall efficiency. 


The saying, "A chain is as strong as the weakest link" recognizes this truth. The folk saying 
that "A rope of grass could easily tie an elephant" recognizes the strength of parallel 
reinforcement. Science converts such ideas into mathematical relations and numbers. Only 
then is the design of machines possible. 


